Navigating the unexplored seascape of pre-miRNA candidates in single-genome approaches
نویسندگان
چکیده
MOTIVATION The computational search for novel microRNA (miRNA) precursors often involves some sort of structural analysis with the aim of identifying which type of structures are prone to being recognized and processed by the cellular miRNA-maturation machinery. A natural way to tackle this problem is to perform clustering over the candidate structures along with known miRNA precursor structures. Mixed clusters allow then the identification of candidates that are similar to known precursors. Given the large number of pre-miRNA candidates that can be identified in single-genome approaches, even after applying several filters for precursor robustness and stability, a conventional structural clustering approach is unfeasible. RESULTS We propose a method to represent candidate structures in a feature space, which summarizes key sequence/structure characteristics of each candidate. We demonstrate that proximity in this feature space is related to sequence/structure similarity, and we select candidates that have a high similarity to known precursors. Additional filtering steps are then applied to further reduce the number of candidates to those with greater transcriptional potential. Our method is compared with another single-genome method (TripletSVM) in two datasets, showing better performance in one and comparable performance in the other, for larger training sets. Additionally, we show that our approach allows for a better interpretation of the results. AVAILABILITY AND IMPLEMENTATION The MinDist method is implemented using Perl scripts and is freely available at http://www.cravela.org/?mindist=1. CONTACT [email protected] SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
منابع مشابه
Identification of novel microRNA-like-coding sites on the long-stem microRNA precursors in Arabidopsis.
Plant microRNA (miRNA) is a crucial regulator of gene expression. It has been reported that more than one miRNA/miRNA duplex could be produced from a microRNA precursor (pre-miRNA). In this study, we performed a comprehensive search for the novel miRNA candidates on the pre-miRNAs of Arabidopsis. AGO1 enrichment, co-existence of the miRNA-like coordinates, and unique genome-wide match sites wer...
متن کاملNavigating the currents of seascape genomics: how spatial analyses can augment population genomic studies
Population genomic approaches are making rapid inroads in the study of non-model organisms, including marine taxa. To date, these marine studies have predominantly focused on rudimentary metrics describing the spatial and environmental context of their study region (e.g., geographical distance, average sea surface temperature, average salinity). We contend that a more nuanced and considered app...
متن کاملO-36: Genome Haplotyping and Detection of Meiotic Homologous Recombination Sites in Single Cells, A Generic Method for Preimplantation Genetic Diagnosis
Background: Haplotyping is invaluable not only to identify genetic variants underlying a disease or trait, but also to study evolution and population history as well as meiotic and mitotic recombination processes. Current genome-wide haplotyping methods rely on genomic DNA that is extracted from a large number of cells. Thus far random allele drop out and preferential amplification artifacts of...
متن کاملI-45: FISH and Array CGH for PGD of Cancer
We developed several FISH approaches to enable preimplantation genetic diagnosis of cancer predisposition syndromes. An overview of the applications and the results of those PGDs will be provided. In addition we developed several novel tools to genome wide screen for CNVs and SNPs in single cells. Those technologies are now being applied for polar body, blastomere and blastocyst screening for c...
متن کاملBioinformatics identification of miRNA-mRNA regulatory network contributing to lung cancer invasion
Background: Over the past 15 years, significant insights have been gained into the roles of miRNAs in cancer. In various cancers, miRNAs can act as oncogenes, tumor suppressors, or control the metastasis process by modulating the expression of numerous target genes. This study is aimed at determining molecular network of miRNA-mRNA regulating lung cancer invasion, by bioinformatics approaches. ...
متن کامل